arXiv:math/0503521 vl [math.PR] 24 Mar 2005 


The Annals of Applied Probability 

2005, Vol. 15, No. IB, 914-940 

DOI: 10.1214/105051604000000774 

(c) Institute of Mathematical Statistics, 2005 

ASYMPTOTICS IN RANDOMIZED URN MODELS 

By Zhi-Dong Bai^ and Feifang Hu^ 

Northeast Normal University and National University of Singapore, and 

University of Virginia 

This paper studies a very general urn model stimulated by designs 
in clinical trials, where the number of balls of different types added 
to the nrn at trial n depends on a random outcome directed by the 
composition at trials 1,2,..., n — 1. Patient treatments are allocated 
according to types of balls. We establish the strong consistency and 
asymptotic normality for both the urn composition and the patient 
allocation under general assumptions on random generating matrices 
which determine how balls are added to the urn. Also we obtain 
explicit forms of the asymptotic variance-covariance matrices of both 
the urn composition and the patient allocation. The conditions on the 
nonhomogeneity of generating matrices are mild and widely satisfied 
in applications. Several applications are also discussed. 

1. Introduction. In designing a clinical trial, the limiting behavior of 
the patient allocation to several treatments during the process is of primary 
consideration. Suppose patients arrive sequentially from a population. Adap¬ 
tive designs in clinical trials are inclining to assign more patients to better 
treatments, while seeking to maintain randomness as a basis for statistical 
inference. Thus the cumulative information of the responses of treatments 
on previous patients will be used to adjust treatment assignment to coming 
patients. For this purpose, various urn models [Johnson and Kotz (1977)] 
have been proposed and used extensively in adaptive designs [for more ref¬ 
erences, see Zelen (1969), Wei (1979), Flournoy and Rosenberger (1995) and 
Rosenberger (1996)]. 

One large family of randomized adaptive designs is based on the gen¬ 
eralized Friedman’s urn (GFU) model [Athreya and Karlin (1967, 1968), 


Received June 2003; revised March 2004. 

^Supported by NSFC Grant 201471000 and NUS Grant R-155-000-030-112. 

^Supported by NSF Grant DMS-0204232 and NUS Grant R-155-000-030-112. 

AMS 2000 subject classifications. Primary 62E20, 62L05; secondary 62F12. 

Key words and phrases. Asymptotic normality, extended Polya’s urn models, general¬ 
ized Friedman’s urn model, martingale, nonhomogeneous generating matrix, response- 
adaptive designs, strong consistency. 

This is an electronic reprint of the original article published by the 
Institute of Mathematical Statistics in The Annals of Applied Probability, 

2005, Vol. 15, No. IB, 914-940. This reprint differs from the original in pagination 
and typographic detail. 


1 




2 


Z.-D. BAI AND F. HU 


also called the generalized Polya urn (GPU) in the literature]. The model 
can be described as follows. Consider an urn containing balls of K types, 
respectively, representing K “treatments” in a clinical trial. These treat¬ 
ments are to be assigned sequentially in n stages. At the beginning, the urn 
contains Yq = (Yoij • • • j^oa) balls, where Yofc denotes the number of balls 
of type k, k = I,..., K. At stage i, i = 1,... ,n, a ball is randomly drawn 
from the urn and then replaced. If the ball is of type q, then the treatment 
q is assigned to the ith patient, q = 1,..., K, z = 1,... ,n. We then wait 
until we observe a random variable ^(z), which may include the response 
and/or other covariates of patient z. After that, an additional Dq^ii) balls 
of type k, k = 1,..., K, are added to the urn, where Dqk{i) is some func¬ 
tion of ^(z). This procedure is repeated throughout the n stages. After n 
splits and generations, the urn composition is denoted by the row vector 
Yn = (Y„i,..., Ynx), where represents the number of balls of type k in 
the urn after the rzth split. This relation can be written as the following 
recursive formula: 


Yji — Y^_i -|- 

where is the result of the nth draw, distributed according to the urn 
composition at the previous stage; that is, if the nth draw is a type-A: ball, 
then the A:th component of X„ is 1 and other components are 0. Furthermore, 
write N„ = {Nni, ■ ■ ■, NnK), where Nnk is the number of times a type-A; ball 
was drawn in the first n stages, or equivalently, the number of patients who 
receive the treatment k in the first n patients. 

For notation, let Dj = {{Dqk{i),q,k = and let Y) be the se¬ 
quence of increasing cr-fields generated by {Yj}*-^g, and 

Define Hj = {{E{Dqk{i)\J^i-i), q,k = 1,..., K)), z = 1,...,n. The matrices 
D* are called addition rules and H* generating matrices. In practice, the 
addition rule Dj often depends only on the treatment on the zth patient 
and its outcome. In these cases, the addition rules Dj are i.i.d. (independent 
and identically distributed) and the generating matrices Hj = H = UDj are 
identical and nonrandom. But in some applications, the addition rule Dj 
depends on the total history of previous trials [see Andersen, Faries and 
Tamura (1994) and Bai, Hu and Shen (2002)]; then the general generat¬ 
ing matrix Hj is the conditional expectation of Dj given Therefore, 

the general generating matrices {Hj} are usually random. In this paper, we 
consider this general case. Examples are considered in Section 5. 

A GFU model is said to be homogeneous if Hj = H for all z = 1,2,3,.... 
In the literature, research is focused on asymptotic properties of Y„ for ho¬ 
mogeneous GFU. First-order asymptotics for homogeneous GFU models are 
determined by the generating matrices H. In most cases, H is an irreducible 
nonnegative matrix, for which the maximum eigenvalue is unique and posi¬ 
tive (called the maximal eigenvalue in the literature) and its corresponding 
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left eigenvector has positive components. In some cases, the entries of H may 
not be all nonnegative (e.g., when there is no replacement after the draw), 
and we may assume that the matrix H has a unique maximal eigenvalue A 
with associated left eigenvector v = (ui,... ,vk) with Under the 

following assumptions: 


(i) Pr{Dqk =0,k = l,...,K} = 0 for every q = l,...,K, 

(ii) Dqk > 0 for all g. A; = 1,, K, 

(hi) H is irreducible, 

Athreya and Karlin (1967, 1968) prove that 


( 1 . 1 ) 



and 


^nk 


Y 

Z^g=l r-nq 


Vk 


almost surely as n —> oo. 

Let Ai be the eigenvalue with a second largest real part, associated with 
a right eigenvector If A > 2Re(Ai), Athreya and Karlin (1968) show that 


( 1 . 2 ) 




in distribution, where c is a constant. When A = 2Re(Ai) and Ai is simple, 
then (1.2) holds when is replaced by l/-\/nln(n). Asymptotic results 

under various addition schemes are considered in Freedman (1965), Mah¬ 
moud and Smythe (1991), Holst (1979) and Gouet (1993). 

Homogeneity of the generating matrix is often not the case in clinical 
trials, where patients may exhibit a drift in characteristics over time. Ex¬ 
amples are given in Altman and Royston (1988), Goad (1991) and Hu and 
Rosenberger (2000). Bai and Hu (1999) establish the weak consistency and 
the asymptotic normality of Y„ under GFU models with nonhomogeneous 
generating matrices Hj. [In that paper, it is assumed that Hj = EDj, so Hi 
are fixed (not random) matrices.] They consider the following GFU model 
(GFUl): Dgk{i) = ci > 0, for all g = 1,..., K and i = 1,..., n, the to¬ 

tal number of balls added at each stage is a positive constant. They assume 
there is a nonnegative matrix H such that 



where a* = ||Hi — H||oo. 

In clinical trials, represents the number of patients assigned to the 
treatment k in the first n trials. Doubtless, the asymptotic distribution and 
asymptotic variance of N„ = {Nni, ■ ■ ■, NnK) is of more practical interest 
than the urn compositions to sequential design researchers. As Athreya 
and Karlin [(1967), page 275] said, “It is suggestive to conjecture that 
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(Nni,..., Nnx) properly normalized is asymptotically normal. This prob¬ 
lem is open.” The problem has stayed open for decades due to mathematical 
complexity. One of our main goals of this paper is to present a solution to 
this problem. 

Smythe (1996) defined the extended Polya urn (EPU) (homogeneous) 
models, satisfying ^{Dqk) = ci > 0, g = 1,..., iP; that is, the expected 

total number of balls added to the urn at each stage is a positive constant. 
For EPU models, Smythe (1996) established the weak consistency and the 
asymptotic normality of Y„ and N„ under the assumptions that the eigen¬ 
values of the generating matrix H are simple. The asymptotic variance of 
Nn is a more important and difficult proposition [Rosenberger (2002)]. Re¬ 
cently, Hu and Rosenberger (2003) obtained an explicit relationship between 
the power and the variance of in an adaptive design. To compare the ran¬ 
domized urn models with other adaptive designs, one just has to calculate 
and compare their variances. Matthews and Rosenberger (1997) obtained 
the formula for asymptotic variance for the randomized play-the-winner rule 
{K = 2) which was initially proposed by Wei and Durham (1978). A general 
formula for asymptotic variance of was still an open problem [Rosen¬ 
berger (2002)]. 

In this paper, we 

(i) show the asymptotic normality of N„ for general H; 

(ii) obtain a general and explicit formula for the asymptotic variance of 

N„; 

(hi) show the strong consistency of both and N„; and 

(iv) extend these results to nonhomogeneous urn model with random 
generating matrices H,. 

The paper is organized as follows. The strong consistency of Y„ and N„ 
is proved in Section 2 for both homogeneous and nonhomogeneous EPU 
models. Note that the GFUl is a special case of EPU. The asymptotic nor¬ 
mality of Y„ for homogeneous and nonhomogeneous EPU models is shown 
in Section 3 under the assumption (1.3). We consider cases where the gen¬ 
erating matrix H has a general Jordan form. In Section 4, we consider the 
asymptotic normality of = {Nni, ■ ■ ■, Nuk) for both homogeneous and 
nonhomogeneous EPU models. Further, we obtain a general and explicit 
formula for the asymptotic variance of N„. 

The condition (1.3) in a nonhomogeneous urn model is widely satisfied in 
applications. In some applications [e.g., Bai, Hu and Shen (2002)], the gen¬ 
erating matrix Hj may be estimates of some unknown parameters updated 
at each stage, for example, Hj at ith stage. In these cases, we usually have 
ai = 0(i“^/^) in probability or almost surely, so the condition (1.3) 

is satisfied. Also (1.3) is satisfied for the case of Hu and Rosenberger (2000). 
Some other applications are considered in Section 5. 
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2. Strong consistency of and N„. Using the notation defined in the 
Introduction, is a sequence of random iC-vectors of nonnegative elements 
which are adaptive with respect to satisfying 

(2.1) U(Yi|^,_i)=Y,_iM„ 

where Mj = I + Hj = E{T)i\Ti-i) and a* = Without loss 

of generality, we assume oq = 1 in the following study. 

In the sequel, we need the following assumptions. 

Assumption 2.1. The generating matrix Hj satisfies 
Hqk{i) > 0 for all k,q and 

( 2 . 2 ) 


K 


for all g = 1,..., A, 


^ Cl 

k=l 

almost surely, where Hqk(i) is the (g, A:)-entry of the matrix Hj and ci is a 
positive constant. Without loss of generality, we assume ci = 1 throughout 
this work. 

Assumption 2.2. The addition rule Dj is conditionally independent of 
the drawing procedure Xj given Ei-i and satisfies 

(2.3) {i)\Ei_i) <C < oo for all g. A: = 1,..., A and some d > 0. 
Also we assume that 

(2.4) coY[{Dqk{i),Dqi{i))\Ei-i]^ dqki for all g,A;,^ = 1,...,A, 

where dq = {dqki)ki=i, Q = K, are some K x K positive definite ma¬ 
trices. 

Remark 2.1. Assumption 2.1 defines the EPU model [Smythe (1996)]; 
it ensures that the number of expected balls added at each stage is a positive 
constant. So after n stages, the total number of balls, an, in the urn should 
be very close to n [anln converges to 1). 

The elements of the addition rule are allowed to take negative values in 
the literature, which corresponds to the situation of withdrawing balls from 
the urn. But, to avoid the dilemma that there are no balls to withdraw, 
only diagonal elements of Dj are allowed to take negative values, which 
corresponds to the case of drawing without replacement. 

To investigate the limiting properties of Y„, we first derive a decomposi¬ 
tion. From (2.1), it is easy to see that 

Yn = {Yn - A(Y„|J-„_i)) + Y„_iM„ 

= Qn + Yn-lGn + Yn-l(]VIn — Gn) 

(2.5) 


= Y0G1G2 • • • G„ + ^ Q,B„,j + ^ Y,_i(M, - Gj)B„,j 
= Si -I- S 2 -F S 3 , 


1=1 


2=1 
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where Qj = Yj - E'(Yj| J'j.i), G* = I + and = Gj+i • • • G„ with 
the convention that = I and Tq denotes the trivial cr-field. 

We further decompose S3 as follows: 

n 

S3 = E 

=Ea-\Yi-i(H,-H)B„,i + 

1 1 Uy7 I V 

2=1 2=1 ^ ^ 

= S31 + S32. 

To estimate the above terms in the expansion, we need some preliminary 
results. First, we evaluate the convergence rate of a„. To this end, we have 
the following theorem. 

Theorem 2.1. Under Assumptions 2.1 and 2.2, (a) anjn^l a.s. as 
n 00 , and (b) n~^{an — n) ^ 0 a.s. for any k > 1/2. 

Proof. Let ei = Oi — ai-i for z > 1. By definition, we have e* = XjDjl, 
where X* is the result of the zth draw, multinomially distributed according 
to the urn composition at the previous stages; that is, the conditional prob¬ 
ability that the ith draw is a ball of type k (the kih. component of X* is 1 
and other components are 0) given previous status is Yi^i^k/di-i- 
From Assumptions 2.1 and 2.2, we have 

(2.7) = l 
and 

E{el) = E[E{el\E^-l)] = .E[E(l'D'X'XiDa|.F,_i)] 

= l'E[E(D'X'X,D,|.Fi_i)]l 

= l'E[E{Ti[dia.g{a-\Yi_i)T>i\Ei-i)]l 

(2.8) K K K 

q=lk=l 1=1 

<ck‘^, 

so that 

n n 

(2.9) an - n = ao -fE(^» “ 1) = 1 + E(^* “ E{ei\Ei-i)) 

2=1 2 = 1 

forms a martingale sequence. 

From Assumption 2.2 and n > 1/2, we have 


72 -\7- 

Y^-i t - ai-i 
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By three series theorem for martingales, this implies that the series 

OO -I 

ej — 1 

t=i 

converges almost surely. Then, by Kronecker’s lemma, 

1 ” 

1=1 

almost surely. This completes the proof for conclusion (b) of the theorem. 

The conclusion (a) is a consequence of conclusion (b). The proof of The¬ 
orem 2.1 is then complete. □ 

Assumption 2.3. Assume that (1.3) holds almost surely. Suppose that 
the limit generating matrix H, K x K, is irreducible. 

This assumption guarantees that H has the Jordan form decomposition 






(\t 

1 

0 ••• 

o\ 

/1 

0 • 

■ 


0 

At 

1 ••• 

0 

0 

Jl 

• 0 

with Jt = 





V 0 

0 • 

• jj 


0 

0 

••• At 

1 


T“^HT = J = 


0 

where 1 is the unique maximal eigenvalue of the matrix H. Denote the order 
of Jt by vt and r = max{Re(Ai),..., Re(As)}. We define v = max{ut: Re(Ai) = 

t}- 

Moreover, the irreducibility of H also guarantees that the elements of 
the left eigenvector v= {vi,...,Vp) associated with the positive maximal 
eigenvalue 1 are positive. Thus, we may normalize this vector to satisfy 

Remark 2.2. Condition (1.3) in Assumption 2.3 is very mild, just slightly 
stronger than Oi —> 0, for example, if the nonhomogeneous generating ma¬ 
trix Hj converges to a generating matrix H with a rate of for some 

c> 0. 

What we consider here is the general case where the Jordan form of the 
generating matrix H is arbitrary, relaxing the constraint of a diagonal Jordan 
form as usually assumed in the literature [see Smythe (1996)]. 


In some conclusions, we need the convergence rate of Hj as described in 
the following assumption. 
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Assumption 2.4. 

(2.10) \m-EUi 


o(rV2), 

+ 1 )), 


if r/i, 
if r = i, 


where ||(ajj)|| = random matrix (aij). 


A slightly stronger condition is 

( 2 . 11 ) \\Ui-EUi\\ = o{i-^/^). 


Remark 2.3. This assumption is trivially true if Hj is nonrandom. It is 
also true when H* is a continuously differentiable matrix function of status 
at stage i, such as Yj, Nj or the relative frequencies of the success, and so 
on. These are true in almost all practical situations. 


For further studies, we define 

f ^/n, if r < 1/2, 

= S \/^log^“^/^n, if t=1/2, 

[ log^"”^ n, ifr>l/2. 

Theorem 2.2. Under Assumptions 2.1-2.3, for some constant M, 

( 2 . 12 ) E\\Yn-EYr,f<MVl 

From this, for any k > r V we immediately obtain n“'^(Y„ — EY^) —> 
0, a.s., where a V 6 = max(a, 6). Also, if k = 1 or the condition (1.3) is 
strengthened to 

OO 

(2.13) 

i=i 

then EYn in the above conclusions can be replaced by nv. This implies that 
n~^Yn almost surely converges to v, the same limit of n~^EYn, as oo. 


Proof. Without loss of generality, we assume oq = 1 in the follow¬ 
ing study. For any random vector, write ||Y|| := V EYY'. Define yn = 
{yn,i, • • •, yn,K) = Y„T. Then, (2.12) reduces to 

(2.14) ||y„ - Fly^ll < M14- 

In Theorem 2.1, we have proved that \\an — nW"^ < CK‘^n [see (2.9) and (2.8)]. 
Noticing that Eon = n + 1, the proof of (2.12) further reduces to showing 
that, for any j = 2 ,..., K , 

(2.15) \\yn,j - Eyn,j\\ < MVn- 
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We shall prove (2.15) by induction. 

Suppose no is an integer and M a constant such that 


(2.16) 


i=nQ 


E U-i \ iU 


\og^ i 
5/4 


<e, 


i=no 


M = 


Cl + C2 + C3 + C4 + C^ + (C3 + 2 C^')Mq 

1 -3e ^ 


where e < 1/4 is a prechosen small positive number, Mq = max„<„u{||y„j — 
and the constants C’s are absolute constants specified later. 

We shall complete the proof by induction. Consider m > no and assume 
that ||y — Eyn\\ < MVn for all no <n <m. 

By (2.5) and (2.6), we have 


(2.17) 


+ E 

2=1 


2=1 

I ft?—1 ^ y2—1 

CLi — \ 


2=1 


UmJ — y0®m,0,j 


+E 


2=1 


I di—i ^ y2 —1 

di—i 


W R 

2-*-'m,2 j ? 


where Qi = Q^T, W* = T-i(Hi - H)T and 

= T~iB„,iT = (! + (/ + l)-i J) • • • (I + n-i J) 


/ 


(2.18) 


i=*+i 


n 1 +) 


j=i+l 

0 


0 

0 


0 0 ••• n (i+r'Js) 

V .=*+1 

and Bm,i,j is the jth column of the matrix B^.i. 

In the remainder of the proof of the theorem, we shall frequently use the 
elementary fact that 

A 


(2.19) 


j=i+l 


n 1+7 


n 


'ilj{n,i,X), 


where 'ip{n,i,X) is uniformly bounded (say < V’) and tends to 1 as / ^ 00 . 
In the sequel, we use 'ip{n,i,X) as a generic symbol, that is, it may take 
different values at different appearances and is uniformly bounded (by ?/>, 
say) and tends to 1 as z —> 00 . Based on this estimation, one finds that the 
(/i,/i + ^)-element of the block matrix nr=j+ 2 (I + J*) asymptotically 

equivalent to 

(2.20) log^(y)V’(n,J,Ai), 
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where At is the eigenvalue of Jt- 

By (2.17) and triangular inequality, we have 


112/m,j -®2/m,j|| 

< ||yoBm,0,j II + 


( 2 . 21 ) 


+ 


E 

2=1 

m 


2=1 

i CLi—\ 'N Yi—1 


JB 


^ 2—1 


m,2,j 


+E 

2=1 


I CLi—\ ^ Yi—1 
(li — \ 


W R 

2 -*-'m, 2 ,j 


+E 

2=1 


yt-i Wj - .Eyt_i Wj 


B 


m,i,j 


Consider the case where 1 + + • • • + ut-i < j <^ + I'l + ■ ■ ■ + Then, 

by ( 2 . 20 ) we have 

(2.22) ||yoBm,o,j|| < Cilm^* | log'^^"^ m < Cil/m- 

Since the elements of i7(Q*Qj) are bounded, we have 

( m >, 1/2 


(2.23) 




2 = 1 


= EBm,*,,^(Q:Q.)Bm,i,, 


. 2=1 


< C 2 < log^^"* 


1/2 


<C2vln! 


for all m and some constant C 2 . 

Noticing that a“3i||yj_i|| is bounded, for r / we have 


E 

2=1 


^ ^2—1 \ Yi—1 


-JB 


^ 2—1 




<E 

2=1 

m 


I di—l \ y2—1 

di—i 


-JB 


m,2j 


<i:c3r‘/^(|A,Ki)(!^)“'‘"\o.-‘(!^) 

771/ s, 

T.(t 


C 3 


2 = 1 


iog"*-M f) < c’sKa, 


for all m and some constant C 3 . 

Now we estimate this term for the case t = ^. We have 


E 

2=1 


I di—i ^ yi—1 

0-2 — 1 


-JB 


772,2 J 
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< E 

i=l 
m 

+ E 

i=l 


1 Oi-l \ I7,yi-1 1X3 
CLi-l 


^ ^i—1 


. O^i—l O^i—1 _ 


JB 


m,i,j 


First, we have 

m 

E 


2 = 1 




^2—1 




= E 


ik 


f E{i - ai-i){k - Qk-i) \ ^Yi-i ^Jk-i 

1 -u n 

\ IK / Oji—\ Qj}^—\ 


<ci E 

i<k<m 




ik 


1/2 


log‘ 


ut-1 


m \ ( m 


k y 




/ m 

[~k 


<C^m ^ m E 


1 


i<k<m 


kVik 


< C^m ^ m. 


Here we point out a fact that for any /r > 1, there is a constant C^> 0 
such that 


E\an-n\^^ 

This inequality is an easy consequence of the Burkholder inequality. [The 
Burkholder inequality states that if Xi ,..., Xn is a sequence of martin¬ 
gale differences, then for any p > 1, there is a constant C = C(p) such that 

E\ ET=iX^\P < CpE{EtiE{\Xf\\E^.,y/\] 

By using = i -|- and the above inequality, we have 

di — X 2 idi — x 
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log"- 


i=no+l 




2=1 


< (CsMo + C 3 + eM)\/mlog'^ ^ m. 

Combining the above four inequalities, we have proved 


(2.24) 


E 

2 = 1 


% (^i—\ \ y2—1 

(ii—\ 


-JB 


m,2 j 


<(C3Mo + C3 + eM)y™. 


By (1.3) and the fact that aj_\||yj_i|| is bounded, we have 


E 

2=1 


(2.25) 


% (li—\ ^ Yi—l 

Cli—l 


<'^€41 




2 = 1 


2=1 


< C4y/rnY^ - 

< C4Vm. 

Next, we show that 


Oi / m 
i \ i 


^\Re(At) 

—j log * 
-1/2 

log^*-^ 


m 

% 

m 

i 


E 

2=1 


yi_iWi - Eyi^iWi 


B 


(2.26) 


<E 

2=1 


yi i - Syi] 


m,2 j 


W B 

2 -*-'m, 2 j 


+E 

2=1 

m 

+E 

2=1 


^;(yi_i - ^;yi_i)Wj 


B 


m,2j 


Eyi-1 


{Wi - EWi)Bm,i,, 


< {2eM + C^{2Mq + l))Vm- 

By (1.3) and the induction assumption that ||yi-i — Eyi-i II < M'/i, 

m T-i 

y^-i -Eyi-i. 


E 

2 = 1 


W B 

2 -*-*m ,2 j 


no 


^ Re(At) 


i=i ^ ^ 


log' 


Pt-l 
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+ E MV.r^aA'^) 

I 1 V ‘ / 


i=no+l 
< {C^Mq + eM^Yrr, 
By Jensen’s inequality, we have 


\ Re(At) 


log' 


Pt-l 


E 

2=1 




<E 


Yi-i - Eyi 


W R 

2 -*-'m ,2 j 


2=1 

< [C^Mq + eM)Vm- 


The estimate of the third term is given by 

m T-1 

Eyi-1 


E 

2=1 




jn /™\Re(At) / 

<C5^||Wi-ii;w,|K-j log (- 


(2.27) 


2=1 


/ m / 


^ \ Re(At) 


< < 


2 = 1 
m 


KT) 


log' 




m 


i = l ^ ^ 

< C^Vm- 


^\ Re(At) 


log 


vt-1 


m 


if T / 


if r = 


The above three estimates prove the assertion (2.26). 

Substituting (2.22)-(2.26) into (2.21), we obtain 

\\yn,j ~EynjW < (3eMTCi + (72 + Cl's + (74 + C^ + [C^ + 2C^)MQ)Vm < MVm- 


We complete the proof of (2.15) and thus of (2.12). 

Since k > r V1/2, we may choose ki such that k > ki > r V 1/2. By (2.12), 
we have 


\\Yn - EYnf < 

From this and the standard procedure of subsequence method, one can show 
that 


n-"(Y„ - EYn) ^ 0 a.s. 

To complete the proof of the theorem, it remains to show the replacement 
of EYn by nv, that is, to show that ||2/nj|| < MVn if (2.13) holds and that 
Ill/njII = o(n) under (1.3). Here the latter is for the convergence with k=1. 

Following the lines of the proof for the first conclusion, we need only to 
change Ei/mj on the left-hand side of (2.21) and replace Fiyj_iWj on the 
right-hand side of (2.21) by 0. Checking the proofs of (2.22)-(2.26), we find 
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that the proofs of (2.22)-(2.26) remain true. Therefore, we need only show 
that 


E 

i=l 

(2.28) 


Ey^-lWi 


B 


m,2 j 




2 = 1 


= < 


rn / 

! —\ m 

V ^ ^ — 


t-1/2 


m 




ai ( m 


2=1 


I \ I 


T—1 


log 


log' 


ut-1 


m 

i 


This completes the proof of this theorem. 


y) < 0 (Cn), 

< em, 

□ 


if (2.13) holds, 
if (1.3) holds. 


Recall the proof of Theorem 2.2 and note that e can be arbitrarily small; 
with a slight modification to the proof of Theorem 2.2, we have in fact the 
following corollary. 


Corollary 2.1. In addition to the conditions of Theorem 2.2, assume 
(2.11) is true. Then, we have 

n 

(2.29) Yn- - Eyn- = QiBn,* _ + Op(14), 

2=1 

where y„ _ = {yn, 2 -, • • •, yn,K) and Bn,i- = • • •, Bn,i,K) ■ 

Furthermore, if (2.13) is true, Eyn - in (2.29) can he replaced by 0. 

Proof. Checking the proof of Theorem 2.2, one finds that the term 
estimated in ( 2 . 22 ) is not necessary to appear on the right-hand side of 
(2.21). Thus, to prove (2.29), it suffices to improve the right-hand sides of 
(2.24)-(2.26) as to eVm- The modification for (2.24) and (2.25) can be done 
without any further conditions, provided one notices that the vector yi_i in 
these inequalities can be replaced by (0,yi_i^_). The details are omitted. To 
modify (2.26), we hrst note that (2.27) can be trivially modified to eVm if 
the condition (2.10) is strengthened to (2.11). The other two estimates for 
proving (2.26) can be modified easily without any further assumptions. □ 


N„ = ^X, = ^(Xi 
2=1 2=1 


E{IL,\Ti-i))+ Y, 

2=1 


Y,i 


^2—1 


Note that 
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Since (Xj — E'(Xj|JPi_i)) is a bounded martingale difference sequence, we 
have 

n 

n-"^(X,-E(X,|^,_i))^0 a.s. 
i=l 


for any k > 1/2. Also, 

n 

EY.- 


n 


i-l\i 


2 = 1 




■-1+K1 A 


i=l 


AKl 


a.s. 


In view of these relations and Theorem 2.2, we have established the fol¬ 
lowing theorem for the strong consistency of N„. 


Theorem 2.3. Under the assumptions of Theorem 2.2, — ENn) 0, 

a.s. for any k > r V 1/2. Also, in the above limit, can be replaced by nv 

if K = 1 or (2.13) is true. This implies that almost surely converges 

to V, the same limit of n~^ E'Nn, n—>oo. 


3 . Asymptotic normality of In the investigation of the asymptotic 

normality of the urn composition, we first consider that of a^, the total 
number of balls in the urn after n stages. 


Theorem 3.1. Under Assumptions 2.1-2.3, n —n) is asymptoti¬ 
cally normal with mean 0 and variance an, where an = 12^=1 Vqdqki- 


Proof. From Theorems 2.1 and 2.2, we have that Ynfan —> v a.s. Sim¬ 
ilar to (2.8), we have 


n K K K 

n“^^var(ei|JP)_i) ^ EEE '^qdqkl 

i=l q=l k=l 1=1 


Assumption 2.2 implies that {e* — E{ei\Ei-i)} satisfies the Lyapunov con¬ 
dition. From the martingale CLT [see Hall and Heyde (1980)], Assumptions 
2.1-2.3 and the fact that 

n 

a„ - n = 1 -L ^(cj - E{ei\Ei-i)), 

1 = 1 


the theorem follows. □ 


Theorem 3.2. Under the assumptions of Theorem 2.2, V~^{Yn — EYn) 
is asymptotically normal with mean vector 0 and variance-covariance ma¬ 
trix where S is speeified later, = n if t < 1/2 and = 

nlog^^“^n if T = 1/2. Here r is defined in Assumption 2.3. 

Also, if (2.13) holds, then EYn can be replaced by nv. 
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Proof. To show the asymptotic normality of Y„ — EYn, we only need 
to show that of (Yn — EYn)T = yn — Eyn- 
From the proof of Theorem 3.1, we have 


n 

yn,i - Eyn, I = On - n - 1 = - E{ei\Ei-i)). 

i=l 


From Corollary 2.1, we have 


n 

yn,— Eyn,— — ^ ^ Qi^n,i,— T Op(V^). 

i=l 


Combining the above estimates, we get 


(3.1) 


yn - Eyn = 




2 = 1 


n—1 

^ ) QiBji j 2 

i=l 


n—1 


^ ) QjBfi j 


K 


+ o{Vn 


i=l 


Again, Assumption 2.2 implies the Lyapunov condition. Using the CLT 
for martingale sequence, as was done in the proof of Theorem 2.3 of Bai and 
Hu (1999), from (3.1), one can easily show that Vn^{yn — Eyn) tends to a 
iF-variate normal distribution with mean 0 and variance-covariance matrix 
(^T Sm)' variance-covariance matrix S 22 of the second to the ATth 
elements of U„~^(yn — Eyn) can be found in (2.17) of Bai and Hu (1999). 

By Theorem 3.1, for the case r = 1/2, Vn = -v/nlog‘"“^'^^ n, an = 0 and 
S 12 = 0. When r < 1/2, I 4 = ^/n, an = J2f=iJ2k=iJ2f=iVgdqki. Now, let 
us find S 12 . Write T = (T, Ti,..., T^) = (l',T_), T^ = (t'-^,..., and 
B„,j _ = T“^B„_jT_ = (B„,i, 2 , • • •, 'Bn,i,K), where 1 = (1 ,..., 1) throughout 
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this paper. Then the vector S 12 is the limit of 


n 


-1 


^cov[(ei,Qj)|J^i_i]TB„,j _ 


2=1 


= ^ 1 cov[(D'X', XiD,)|.Fi_i]TB„,i,_ 

2=1 

(3.2) = E 1 ( E + H*(diag(v) - v*v)H ) TB„,,,_ + 0^(1) 

i=l \q=l ) 

= 1 ( v,dg + H*(diag(v) - v*v)H ) Tn-i + Op(l) 

\Q=1 / i=l 

= 1 I E/E! + Op(l)i 

\i}=l / i=l 

where the matrices Aq are defined in (2.4). Here we have used the fact that 
lH*(diag(v) — v*v) = l(diag(v) — v*v) = 0. 

By elementary calculation and the definition of we get 

n 

""‘ESL.- 


2=1 


(3.3) 


( n 

^ ^E n (!+■?' 

2 = lji=2 + l 


V 


0 

0 


^ ^E n (!+•?’ 

i=ij=i+i 


In the hth block of the quasi-diagonal matrix 


^ ^E n (!+■?' 

2=1 ji=2 + l 


the {g,g -|-■^)-element {0 < i < I'h ~ ^) has the approximation 

1 / 77 ,^ . / n \ / 1 \ 


(3.4) 


Combining (3.2)-(3.4), we get an expression of E 12 . 

Therefore, n“^/^(yn — Eyn) has an asymptotically joint normal distribu¬ 
tion with mean 0 and variance-covariance matrix S. Thus, we have shown 
that 


n 


-i/2(y„ - EYn) ^ A^(0, (T-^)*ST-i) 
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in distribution. 

When (2.13) holds, y„ — nei has the same approximation of the right- 
hand side of (3.1). Therefore, in the CLT, EY'n can be replaced by nv. Then, 
we complete the proof of the theorem. □ 


Example 3.1. Consider the most important case in application, where 
H has a diagonal Jordan form and r < 1/2. We have 



/I 0 •• 

■ ° \ 

T"^HT = J = 

: 

o • 

0 


o 

o 

• ^K-l) 

(l',t'^, ... Now let 

K 


R = ^2 + H*(diag(v) 

1=1 

- v*v)H. 


The variance-covariance matrix S = {crij)ij=i has the following simple form: 


0-11 = IRl' = Eg=i Efc=i E«=i Vgdgki, (Tij = (1 - Aj_i) = (1 - Xj_i) Y.k= 

j = 2,..., K, and 

atj = (1 - Ai_i - A,_i)-i(t:_i)'Rt;_i. 


4. Asymptotic normality of Nr^. Now, = (A^ni, • • •, A^nx), where 
is the number of times a type-fc ball is drawn in the first n draws: 

n 

= {Nnl, . . ■,NnK) = N„-l + 

i=l 

where the vectors Xj are defined as follows: If a type-fe ball is drawn in 
the ith stage, then define the draw outcome Xj as the vector whose A:th 
component is 1 and all others are 0. Therefore 1X( = 1 and lN(j = n. We 
shall consider the limiting property of N„. 


Theorem 4.1 (for the EPU urn). Under the assumptions of Corol¬ 
lary 2.1, V~^(Nn- ENn) is asymptotically normal with mean vector 0 and 
variance-covariance matrix where S is specified later, = n 

if T <lj2 and = nlog^*^”^ n if t = \I2. Here r is defined in Assumption 

2.3. 

Furthermore, if (2.13) holds, then EN„ can be replaced by nv. 

Proof. At first we have 

n n 

Nn = ^(Xj - E(Xj| J-,_i)) + ^E(Xj|J-j_i) 

i=l 2=1 

n n—1 

= ^(X, - Yj_i/aj_i) + ^ Yj/oj. 

2=1 2=0 




(4.1) 
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For simplicity, we consider the asymptotic distribution of N„T. Since the 
first component of NjiT is a nonrandom constant n, we only need consider 
the other K — \ components. From (2.29) and (4.1), we get 


n—l 


N„T_ = “ Yi_i/ai_i)T_ + ^ yi-/ai 


2 = 1 


2=0 


= ^(Xi-Yi_i/a*_i)T_ 


2 = 1 


n—l 


n—l 


+ X] yi-/ (f +1) + X 


Yi- (i + l-tti 


i=0 


i=0 


i + l 


n—l . 


= X(X*-Yi_i/a,_i)T„ + X 


(4.2) 


2 = 1 


2=0 


Yi - f i + l-aj 
tti \ i+l 


+ yo,- 


n—l 




.1 + 1 

1=1 Lj=l 

n 

= X(X*-Yi_i/a,_i)T_ 


X + ^Yi- + OpiVi) 


2 = 1 


n—l /n—l 1 \ n—l ^ 


i=i 


. 1 =] 


= X(X* - Yi_i/a,_i)T + X _ + X + Op{Vn), 

1 -1 • 1 ^ ~r 1 

2 = 1 J = 1 2=1 

where Bjj = T“^Gj+i • • • GjT, the matrices with a 

minus sign in subscript denote the submatrices of the last K — I columns of 
their corresponding mother matrices. Here, in the fourth equality, we have 
used the fact that ~ '^p(v^) which can be proven by the 

same approach as showing (2.24) and (2.28). 

In view of (4.2), we only have to consider the asymptotic distribution of 
the martingale 

n n—l 

U = X(Xi - Yi_i/ai_i)T„ + X QiBnj,- 

i=l j=l 

We now estimate the asymptotic variance-covariance matrix of 
For this end, we need only consider the limit of 




-2 


(4.3) 


n—l 


X^(qHI-^f-i) + X ^(q,*QiBn,,,-|^i-i) 


n—l 


n—l 


+ X ^(BY _Q*q/.F,_i) + X ^(BX-B,B„,,- 

1=1 1=1 















20 


Z.-D. BAI AND F. HU 


where qj = (Xj — Yj_i/aj_i)T_ and Rj = = T*RjT. 

From Theorem 3.1, we know that 

E{q*qj\Ej-i) T!.(diag(v) - vV)T_ = Tl diag(v)T_ as j oo, 
since vT„ = 0. This estimate implies that 


(4.4) j=i 

^/TFdiag(v)T_, ifr<l/2, 


0 , if r = l/2, 

Because Qj = [X^-Dj — (Yj_i/oj_i)Hj]T, 


as j —> oo. 


(4.5) 


n—1 

Y- 2 ^i?(q*Q,B„,,-_|.T,_i) 

J = 1 

n—1 

= Y- 2 ^T^i?(X,-Y,_i/a,_i)* 

i=i ^ 

X [XjDj - (Yj_i/aj_i)Hj|.Fj_i]TB„j _ 

= T)ldiag(v)HT(^Y-2^B„,,-_^ +0(1). 


From (2.18), we have 


n—1 


n—1n—1 


(4.6) n-' = n-i ^(I + {j + I)"'J) • • • (I + r ^ J). 

1 ..... Z ~r J- 

j=i j=ii=j 

Based on (2.18)-(2.20), we have that the (h,/i + £)-element of the block 
matrix 


n 


n n—1 1 

1 i 


E E +O’ + !)■'JO • • • (I+JO 

... . Z “r 1 

j=i i=j 


has a limit obtained by 

n n—1 


n 


-1 


EEj 


1 1 a 


-At 


(4.7) 


j=ii=j 

2 rl ru 


log^(j) (1 + 0 ( 1 )) 




-1 


u 

I \ ^+1 


0 Jo 


-At 


log 


I ^ 


dudv 


1-A 


Substituting this into (4.6) and then (4.5), when = n, we obtain that 


n—1 


Y-2 ^ E(q*Q,Btt,,-,-|.F,_i) ^ S 2 = T*_ diag(v)HTJ, 
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where J is a x [K — 1) matrix whose first row is 0 and the rest is a 
block diagonal matrix, the t-block is ut x and its {h, h + £)-element is 
given by the right-hand side of (4.7). The matrix S 2 is obviously 0 when 
= nlog^*^”^ n. 

Note that the third term in (4.3) is the complex conjugate transpose of 
the second term; thus we have also got the limit of the third term, that is, 

Now, we compute the limit S 3 of the fourth term in (4.3). By Assump¬ 
tion 2.2, the matrices Rj in (4.3) converge to R. Then, the fourth term in 
(4.3) can be approximated by 


(4.8) 


n n—1 


E 7 n (I + r--'jpT*RT, 


j=li=j+l r=j+l 


n—1 -y n 


s 7 n (i+r-'j, 


i=j-\-l r=j-\-l 


g,h=l 


Similar to (4.7), we can show that the (rc, t)-element of the (g,/i)-block of 
the matrix in (4.8) is approximately 


2 (i/j)^«(m/j)^'^ log^'(i/j)log*'(m/j) 

(4.9) " ii + l){m + l){w')Kt'y. 

X [T*gRTh](nj-w' 

where [t;rt;,](^, t') is the (tc',t^)-element of the matrix [T*RT/i]. Here, 
strictly speaking, in the numerator of (4.9), there should be factors 'i/’(b j) ^0 
and Since for any jo, the total contributions of terms with j < jo 

is 0 ( 1 ) and the ijj's tend to 1 as j ^ 00 , we may replace the ijj's by 1 . 

For fixed w,w',t and t', if Xg / Xh or Re(Ag) < 1/2, we have 


1 (i/j)"«(m/j)"Mog"'(i/j)log^'(m/j) 

(z + l)(m + l)(u;0!(T)! 


(4-10) - E ^717^(1 - A,)-(“''-'+')(l - A, - A,)-(‘'+^+^) 

£=0 h 

t' 
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Thus, when r < 1/2, if we split S3 into blocks, then the (ic, t)-element of 
the {g, /i)-block x Vh) of S3 is given by 


(4.11) 


S S s^d --y- 

«,'=ot'=o L£=o 1 

+ S 7/^(1 - A.)-<‘'-'«>(l - 


When T = 1/2, Sg^/i = 0 if Ag / A/i or if Re(Ag) < 1/2. Now, we consider 
Sg^/i with \g = \h and Re(Ag) = 1/2. If w' + t' <2v — 2, then 


n—1n—1n—1 

EEE- 

j=i i=j i=j 


log"" (»/j)W i^/j) 

(i + l){i + l)(t(;')!(tO! 


n—1 n—1n—1 

<EEE 


log 


.w'+t' 


n 


^ 3Vil{w')\{t')\ 


< n log 


.w'+t'+l 


n = o{V^). 


When w' = t' = V — 1 which implies w = t = v = Vg = Vh^ by Abelian sum¬ 
mation, we have 


n—1n—1n—1 

K-^EEE- 


i/jY3{i/jY3\og’^ ^{i/j)\og^ \llj) 


v-\l 


(4.12) UUU (i + l)(^ + l)[(^-l)!]2 

-|A,|-^[(z.-1 )!]-2(2^.-1 )-i. 

Hence, for this case, S^/j has only one nonzero element which is the one 
on the right-lower corner of S^and given by 

(4.13) |A,|-2[(z. - 1)!]-2(2 i. - 1)-1[T*RT,](3,i). 

Combining (4.3), (4.4), (4.7), (4.11) and (4.12), we obtain an expression of 


S. 


□ 


Now we consider one of the most important special cases, where the matrix 
H has a diagonal Jordan form and r < 1/2. 

Corollary 4.1. Suppose the assumptions of Corollary 2.1 hold with 
T <1/2 and 




0 • 

0 

\ 

T 1hT = J = 

0 

Ai • 

0 



V 0 

0 • 

• Ax- 

-J 
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where T = Now let 

ttij = (t*_i)'(diag(v) - vV)t'_i, 

bij = Aj_i(l - Aj_i)"^(t*_i)'(diag(v) - v*v)t'_i 

and 

Cij = [(1 - Ai_i)-1 + (1 - A,_i)-^](1 - Ai_i - A,_i)-Ht:_i)'Rt;_i, 

for i,j = 2,...,K. Then re-i/2(N„ - ENn) is asymptotically normal with 
mean vector 0 and variance-covariance matrix where S = 

has the following simple form: 

(Til — ^ij — ^ii — d and ^ij — 
for i,j = 2,...,K. 

5. Applications. 


5.1. Adaptive allocation rules associated with covariates. In clinical tri¬ 
als, it is usual that the probability of success (here we assume that the 
subject response is dichotomous) may depend upon some observable covari¬ 
ates on the patients, that is, pik =Pk{f,i), where are covariates observed 
on the patient i and the result of the treatment at the ith stage. Here 
Pik = P(Ti = l|Aij = k, ^j), for i = 1,..., n and k = 1,..., K, where Xi = k in¬ 
dicates that a type-A: ball is drawn at the ith stage and Tj = 1 if the response 
of the subject i is a success and 0 otherwise. Thus, for a given the addition 
rule could be D(^j) and the generating matrices Hj = = i?D(^j). 

Assume that are i.i.d. random vectors and let H = ii'H(^i). 

The asymptotic properties of the urn composition Y„ are considered by Bai 
and Hu (1999). Based on the results in Sections 2 and 4, we can get the 
corresponding asymptotic results of the allocation number of patients N„. 
Here we illustrate the results by considering the case K = 2. 

Consider the generalized play-the-winner rule [Bai and Hu (1999)] and let 
P{Pkif.i)) =Pk-, A: = 1,2. Then the addition rule matrices are denoted by 


D(Ci) 


diiii) l-dl{C^)\ 
l-f^2('^i) d2{f.i) ) 


and H 


fPi 91 \ 

V 92 P2 / ’ 


where 0 < dk{f,i) < 1 and (/^ = 1 — pfc for A: = 1,2. 

It is easy to see that A = 1, Ai = pi -|-P2 — Ij t = max(0, Ai) and v = 
(92/(91 + 92), 91/(91 + 92))- Further, we have 

P _ (oi92+ a29i)(9 i+92)+ 9i92(pi - 92)^ / 1 -lA 

(91 + 92)2 V-1 ly’ 


T = 


1 91 

1 -92 


and T = 


1 


91+92 


92 

I 


91 

-1 
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where ak = Var((ifc(Ci))- For the case r < 1/2, we have that Vn = n and the 
values corresponding to Corollary 4.1 are 


022 = qiq2, 


b22 = 


(l-qi- q2)qiq2 


C22 = 


qi + q2 

2[(aig2+ a2gi)(gi+ 92 )+ gig2(pi - 92 )^] 
(91 +92)(i - 2(pi +P2 -1)) 


So 


~ _ , 2(1-^1-^2)9192 , 2[(aig2 + a29i)(9i+92) + 9i92(Pi-92)^] 

+ ta + ®)(l-2to+K-l)) 

From Theorem 2.3 and Corollary 4.1, we have 


n 


—a.s. for any (5 <1/2 and ^ A^(0, Si) 


in distribution, where 




T-i ^ 


022 


(9i+92)n-i 1 


1 -1 


For the randomized play-the-winner rule [Wei and Durham (1978)], we 
have ak = Pfc9fc) A: = 1,2. Then we have 


<722 = 


(5 - 2(gi +g2))gi92 
2(91 + 92) - 1 


This result agrees with that of Matthews and Rosenberger (1997). 

For the case r = 1/2, V/ = nlogn and the value corresponding to (4.11) 
is 

022 = 4[(oig2 + a29i)(9i + 92) + 9 i92(pi - 92)^]- 

We have 

(nlogn)“^/^(N„ - nv) ^ iV(0, S2) 
in distribution, where 

4[(aiq'2 + a29i)(9i + 92)+9i92(pi- 92 )^] / 1 -1 


S2 = 


(91 + 92)^ 


-1 1 


For the case of the randomized play-the-winner rule, we have 


S2 = 


49192 


(91 +92)^-! 1 


1 -1 










ASYMPTOTICS OF URN MODELS 


25 


5.2. Clinical trials with time trend in adaptive designs. Time trends are 
present in many sequential experiments. Hu and Rosenberger (2000) have 
studied time trend in adaptive designs and applied to a neurophysiology ex¬ 
periment. It is important to know the asymptotic behavior of the allocation 
number of patients in these cases. 

In Section 5.1, pik = P{Ti = l|Xj = k), where Xi = k if the fcth element 
of Xj is 1. There may be a drift in patient characteristics over time, for 
example, limj^oo= Pfc [Hu and Rosenberger (2000)]. Then the results 
in Sections 2, 3 and 4 are applicable here. For the case K = 2, we can get 
similar results as in Section 5.1. 

The results in this paper may also apply for GFU model with homoge¬ 
neous generating matrix with a general Jordan form as well as r = 1/2. In 
these cases, the results of Smythe (1996) are not applicable. 


5.3. Urn models for multi-arm clinical trials. For multi-arm clinical tri¬ 
als, Wei (1979) proposed the following urn model (as an extension of the 
randomized play-the-winner rule of two treatments): Starting from Yq = 
(Yoi) • ■ • i'Pok)-, when a type k splits (randomly from the urn), we assign the 
patient to the treatment k and observe the patient’s response. A success on 
treatment k adds a ball of type k to the urn and a failure on treatment 
k adds l/(Ar — 1) ball for each of the other K — 1 types. Let pk be the 
probability of success of treatment k, k = 1,2,..., K, and qk = 1 — Pk- The 
generating matrix for this urn model is 


/ Pi 
( A - 1 )- 1(/2 


{K-l)-^qi ••• (A-l)-igi\ 

P2 • • • {K -1) 1^2 


\{K-l)-^qK iK-l)-^qK ••• Pk 


The asymptotic properties of Y^ can be obtained from Athreya and Karlin 
(1968) and Bai and Hu (1999). From Theorem 4.1 in Section 4, we obtain 
the asymptotic normality of N„ and its asymptotic variance. 

Recently, Bai, Hu and Shen (2002) proposed an urn model which adds 
balls depending on the success probabilities of each treatment. Write = 
{Nni ,..., Nrix) and = (5^1,..., Suk)-, where N^k denotes the number of 
times that the kih treatment is selected in the hrst n stages, and Snk denotes 
the number of successes of the feth treatment in the N^k trials, k = l,... ,K. 
Dehne R„ = (R„i,..., Rnx) and M„ = Y.k=i ^nk, where Rn,k = ^ = 

1,... ,K. The generating matrices are 
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Pi 


Hi+i = 


Rji 


Ri2 


P2 


RjK \ 
RjK 

Mi-R,2'^^ 


Rji 

KMi-RiK"^^ 


Ri2 


QK 




Mi - RiK 


PK 
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In this case, H, are random matrices and converge to 
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Pi 

Pi 


P2 


M -p2 

Pi' 

^ M — pk 


92 


QK 


M — Pi 
P2 


P2 


M -pK 


91 


QK 


PK 


M — Pi 

Pk 

M -p2 

PK 


91 ^ 

92 


/ 


almost surely, where M = pi + ■ ■ ■ + px ■ 

Bai, Hu and Shen (2002) considered the convergences of and N„/n. 
The asymptotic distributions of and can be obtained from Theorems 
3.2 and 4.1 in this paper. Prom Lemma 3 of Bai, Hu and Shen (2002) we 
have ai = o(i“^/^) almost surely, so the condition (1.3) is satished. 
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